Performance Improvement of TDOA-Based Speaker Localization in Joint Noisy and Reverberant Conditions
نویسندگان
چکیده
TDOA(time difference of arrival-) based algorithms are common methods for speech source localization. The generalized cross correlation (GCC) method is the most important approach for estimating TDOA between microphone pairs. The performance of this method significantly degrades in the presence of noise and reverberation. This paper addresses the problem of 3D localization in joint noisy and reverberant conditions and a single-speaker scenario. We first propose a modification to make the GCC-PHAse transform (GCC-PHAT) method robust against environment noise. Then, we use an iterative technique that employs location estimation to improve TDOAs accuracy. Extensive experiments on both simulated and real (practical) data (in a single-source scenario) show the capability of the proposed methods to significantly improve TDOA accuracy and, consequently, source location estimates.
منابع مشابه
Approaches for Time Difference of Arrival Estimation in a Noisy and Reverberant Environment
Determining the spatial position of a speaker finds a growing interest in video conference scenario where automated camera steering and tracking are required. As a preliminary step for the localization, microphone array can be used to extract the time difference of arrival (TDOA) of the speech signal. The direction of arrival of the speech signal is then determined by the relative time delay be...
متن کاملVerified speaker localization utilizing voicing level in split-bands
This paper proposes a joint verification-localization structure based on split-band analysis of speech signal and the mixed voicing level. To address the problems in reverberant acoustic environments, a new fundamental frequency estimation algorithm is proposed based on high resolution spectral estimation. In the reconstruction of the distorted speech this information is utilized to reduce the ...
متن کاملConcurrent speaker localization using multi-band position-pitch (m-popi) algorithm with spectro-temporal pre-processing
Accurate, microphone-based speaker localization in real-world environments, like office spaces or meeting rooms, must be able to track a single speaker and multiple concurrent speakers in the presence of reverberations and background noise. Our Multiband Joint Position-Pitch (M-PoPi) algorithm for circular microphone arrays already shows a frame-wise localization estimation score of about 95% f...
متن کاملRobust Speaker Localization Utilizing a Novel Beamforming Algorithm Based on Harmonic Structures
Speaker localization by microphone array has recently received significant attention. Although various methods have been proposed; their performance with short data segments under noise and reverberation degrades considerably. Sound localization based on Steered Response Power (SRP) shows more robustness in practical situations especially with the use of short data segments. In SRP-PHAT algorit...
متن کاملNonlinear filtering for speaker tracking in noisy and reverberant environments
This paper addresses the problem of speaker tracking in a noisy and reverberant environment using time delay of arrival (TDOA) measurements at spatially distributed microphone pairs. The tracking problem is posed within a state-space estimation framework, and models are developed for the speaker motion and the likelihood of the speaker location in the light of the TDOA measurements. The resulti...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- EURASIP J. Adv. Sig. Proc.
دوره 2011 شماره
صفحات -
تاریخ انتشار 2011